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ABSTRACT: A high-throughput screen of the NIH molecular libraries 
sample collection and subsequent optimization of a lead dipeptide-like series of _ 
severe acute respiratory syndrome (SARS) main protease (3CLpro) inhibitors 
led to the identification of probe compound ML188 (16-(R), (R)-N-(4-(tert- 
butyl)phenyl)-N-(2-(tert-butylamino )-2-oxo-1-(pyridin-3-yl) ethyl) furan-2-car- 
boxamide, Pubchem CID: 46897844). Unlike the majority of reported 
coronavirus 3CLpro inhibitors that act via covalent modification of the 
enzyme, 16-(R) is a noncovalent SARS-CoV 3CLpro inhibitor with moderate 
MW and good enzyme and antiviral inhibitory activity. A multicomponent Ugi 
reaction was utilized to rapidly explore structure—activity relationships within 
S;, S$, and S, enzyme binding pockets. The X-ray structure of SARS-CoV 


3CLpro bound with 16-(R) was instrumental in guiding subsequent rounds of 
chemistry optimization. 16-(R) provides an excellent starting point for the further design and refinement of 3CLpro inhibitors 


that act by a noncovalent mechanism of action. 


M@ INTRODUCTION 


Coronaviruses (CoV) are enveloped, large plus-strand RNA 
viruses that cause medical disorders such as the common cold, 
lower respiratory tract infections, and diarrhea.’ The first 
characterized human CoV strains, 229E and OC43, were 
identified and studied extensively from 1965 to the mid-1980s.” 
In 2003, the novel SARS-CoV was identified** as the etiological 
agent of the global pandemic of severe acute respiratory 
syndrome (SARS), an atypical pneumonia that led to 
progressive respiratory failure in over 8000 individuals and 
800 deaths by July of that year.” The death rate of 
approximately 10% for the SARS virus and its ability to spread 
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person-to-person via respiratory droplets make it particularly 
deadly among worldwide pandemic threats. With the 
cooperation of leading nations, a rigorous public healthcare 
campaign was fortunately successful in controlling the outbreak. 
However, a reemergence of the SARS-CoV is still considered a 
potential pandemic risk and potentially new strains of SARS 
could be more severe than that found from the 2003 outbreak. 
Since 2003, two additional human coronaviruses, NL63 and 
HKU1, have been identified in patients around the world, and 
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Figure 1. Representative first generation peptidic 3CLpro inhibitors 1—7 highlighting reactive warhead groups (red) and side chain residues (P, 


blue). 
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Figure 2. Representative second generation nonpeptidic 3CLpro inhibitors highlighting potential warhead groups (red). 


the viruses have been characterized and found to be 
significantly less lethal than SARS-CoV.° * Most recently, a 
new SARS-like virus, called HCoV-EMC, has been identified in 
at least two individuals, one of whom died.” Sequence analysis 
of HCoV-EMC indicates that this virus is more closely related 
to bat coronaviruses than to SARS-CoV. Therefore, the 
possibility of a future SARS-like pandemic remains possible, 
and to date, there are still no vaccines or antiviral agents 
available to prevent or treat SARS-like infections. 

The SARS-CoV genome encodes a large polyprotein that is 
proteolytically processed by two cysteine proteases including 
the 3C-chymotrypsin-like protease (3CLpro) and the papain- 
like protease (PLpro). 3CLpro is essential for proteolytic 
processing at 11 different cleavage sites within the coronavirus 
polyprotein and is thus vital for viral replication.'° The 3CLpro 
enzyme exists primarily as a dimer in solution, and the dimer 
has been confirmed to be the active species for the enzyme 


reaction.'' The cloning and expression of recombinant SARS 


3CLpro,’” along with studies showing that 3CLpro is essential 
for the viral life cycle,’ support a role for 3CLpro as an 
important pathogenic component of SARS-CoV and therefore 
a viable target for antiviral drug development. 

The SARS-CoV 3CLpro has three domains: I (residues 8— 
101), IL (residues 102—184), and III (residues 201—301). 
Domains I and I, which contain the active site region, are /- 
barrel domains, and domain III is an a-helical domain. The 
active site contains a catalytic dyad consisting of a cysteine 
residue (Cys-145) that acts as a nucleophile and a histidine 
residue (His-41) that acts as the general acid—base. Optimized 
octapeptide-based inhibitors using mutational and CoMFA 
models have been reported,'* and more recently, a systematic 
saturation mutagenesis study was conducted at the PS through 
P3’ positions of the substrate.'° These results demonstrate a 
strong structure—activity relationship between 3CLpro and its 
substrate and have provided a basis for peptidomimetic 
inhibitor design. X-ray structures of the SARS-CoV 3CLpro 
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enzyme bound to hexapeptidyl chloromethyl ketone inhibitors 
were first reported,'® '* and numerous peptidic structures now 
exist in the context of targeted antiviral drug design.'” ** These 
first generation protease inhibitors maintain a peptidic nature, 
often five residues in length, and bear a reactive warhead group 
at the terminus that forms a covalent interaction with Cys-145 
(Figure 1, 1—7). Reactive “warhead” groups for 3CLpro have 
included aldehydes, epoxy-ketones, halo-methyl ketones, 
trifluoromethyl ketones, and a number of examples of Michael 
acceptors.” > These inhibitors often first form a noncovalent 
interaction complex with the enzyme, positioning the warhead 
in close proximity to the catalytic cysteine. Attack of the 
thiolate anion of the catalytic cysteine onto the reactive atom of 
the warhead leads to formation of the covalent adduct, 
inactivating the enzyme. One of these compounds, TG- 
0205221 (5), reacts with SARS 3CLpro with a reported K; 
value of ~60 nM.” 

These first generation inhibitors achieved submicromolar 
activity and provided valuable insights into further structure- 
based inhibitor design, quickly leading to nonpeptidic, warhead- 
based small molecule inhibitors (Figure 2, 8—11).'%*°* 
Efforts utilizing virtual screening approaches also proved 
successful leading to nonpeptidic inhibitors (e.g. 12 and 
13).°”°° One of the early compounds disclosed bearing a 
cinnamyl amide, cinanserin (Figure 2, 8), has a reported IC«o 
value of ~5 wM.”° Low molecular weight nonpeptide inhibitors 
bearing reactive esters (9-10b)*”— ? and ketone moieties 
(11a,b)°"*" have demonstrated moderate to good micromolar 
inhibition and in the case of pyridyl ester 10b~ achieved 
inhibition in a cell-based assay below 10 uM. 

The utility and development of using covalently bound 
inhibitors can unfortunately be limited due to their potential for 
off-target side effects and toxicity.*° Despite the fact that there 
are currently 39 commercially available therapeutics that act 
through covalent-modification mechanisms, none of these 
target cysteine proteases.°° The fact that all of the protease 
classes (serine, threonine, aspartyl, and metallo), with the 
exception of cysteine, have been targeted by marketed 
therapeutics suggests that advancing covalent cysteine-protease 
inhibitors to market may have unique challenges.*’ Therefore, 
researchers are now focusing more attention on the 
identification of noncovalent inhibitors. Legitimate noncovalent 
inhibitors described to date have been limited to high molecular 
weight peptidomimetics (MW > 800 amu) with low ligand 
efficiency’’*” and have included known protease drugs 
originally developed for HIV protease.*° In an effort to identify 
small molecule inhibitors of the SARS 3CLpro that inhibit via a 
noncovalent mechanism and possess low micromolar potency 
and good cell permeability, a screen was conducted in 2009 
against the NIH molecular libraries sample collection 
(~293000 compounds) in collaboration with the Mesecar 
laboratory and the Scripps Research Institute Molecular 
Screening Center (SRIMSC) through the molecular libraries 
program center network (MLPCN).”* Herein we describe the 
results of this screening campaign and the prosecution of an 
optimization effort at the Vanderbilt Specialized Chemistry 
Center (VSCC) around a class of 2-(N-arylamido )-2-(pyridin- 
3-yl) acetamide inhibitors of moderate molecular weight. In 
addition, from these collaborative efforts and with the aid of X- 
ray crystallography, we report the details of the molecular 
interaction of 3CLpro inhibitor 16-(R) with the active site of 
3CLpro and further describe the physicochemical and ancillary 
properties of 16-(R). 


M RESULTS AND DISCUSSION 


SARS-CoV 3CL Protease Construct Used in Assays and 
Lack of Inhibition by Cinanserin. The 3CLpro enzyme, also 
known as nonstructural protein 5 (nspS), is liberated from 
coronavirus polyproteins by self-cleavage at its N-terminal 
(nsp4/5) and C-terminal (nsp5/6) cleavage sites. Although the 
autocatalytic mechanism is still being elucidated, current studies 
suggest that it involves an initial dimerization event between 
two unreleased nspSs within the polyprotein followed by a 
series of catalytic steps that liberate a fully active, dimeric 
enzyme that can then proceed to processing the remaining 9 
cleavage sites of the polyprotein.” The SARS-CoV 3CLpro is 
highly active as a dimer and is essentially inactive as a 
monomer.” The addition of affinity tags or additional amino 
acids to the N- and C-termini of 3CLpro is found to 
substantially increase the dimer dissociation constant (Kj) 
and dramatically decrease the enzymatic activity of the 
enzyme.” Therefore, we expressed and purified SARS-CoV 
3CLpro from a newly designed expression construct that 
produces the authentic 3CLpro dimer that would be generated 
in a virus infected cell. The enzyme was purified by a multistep 
chromatographic procedure and was found to be highly active. 

Because the compound cinanserin was reported to be a 
highly effective noncovalent inhibitor of SARS-CoV and 
HCoV-229E 3CLpros, with ICs, values of 5 “4M against each 
enzyme, ° we sought to use this compound as a control 
inhibitor to calculate Z’-factors during our high-throughput 
screening of compound libraries. We tested the ability of 
cinanserin to inhibit our authentic version of SARS-CoV 
3CLpro at a concentration of 100 #M, and surprisingly, we 
found no inhibition of the enzyme. Next, we tested the ability 
of cinanserin to inhibit the 3CLpro enzymes from the human 
coronavirus HKU1 and the mouse coronavirus MHV. Once 
again, we found no inhibition of these 3CL proteases at a 
cinanserin concentration of 100 uM. 

Since our kinetic inhibition results stand in contrast to those 
of Chen and co-workers,*° we sought an explanation for this 
large discrepancy in potency. Upon closer examination of the 
reported methods used to determine the IC., values for 
cinanserin, we found that the 3CLpro expression construct 
utilized by Chen and co-workers incorporated a (His),-affinity 
tag at the N-terminus that extended it by an additional 14 
residues.** The N-terminal affinity tag was never removed from 
the purified 3CLpro enzymes prior to their kinetic studies. As 
described above, the addition of N-terminal or C-terminal 
residues to SARS 3CLpro is known to have a dramatic effect in 
increasing the dissociation constant for dimerization of the 
enzyme, as well as decreasing the enzymatic activity of the 
enzyme. Therefore, cinanserin may be inhibiting a non- 
dimeric form of the enzyme. Another possible explanation is 
that Triton X-100 was not included in the assay buffer to 
remove potential promiscuous inhibition caused by nonspecific 
aggregation of the protein by the compounds.* We tested 
cinanserin for inhibition of SARS-CoV, HKUI1, and MHV 
3CLpro at a concentration of 100 uM cinanserin in both the 
presence and absence of 0.01% Triton X-100 and found no 
inhibition of the enzymes in either case. Ultimately, we were 
unable to use cinanserin as a control inhibitor since no 
inhibition of SARS-CoV, HKU1, or MHV 3CL proteases was 
observed. We therefore used compound 10a (Figure 2), a 
covalent modifier, as a positive control compound. 
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Figure 3. (left) Hit furyl amide 14 (SID49730186); (right) inhibition concentration curve for 3CLpro (gray squares) and PLpro (black squares) and 
14. 
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Screening Campaign and Lead Identification at the 
SRIMSC. High-throughput screening was conducted by 
evaluating the inhibition of the 3CLpro-mediated peptide 
cleavage event using a novel FRET-based substrate (Pubchem: 
AID 1859). We utilized a number of key assay features in the 
screening campaign that have significant advantages over 
previous HTS campaigns against the 3CL enzyme.*° First, we 
utilized a SARS-CoV 3CLpro construct that consisted of the 
native sequence of the enzyme that represents the in vivo 
postproteolytic form of the enzyme. As discussed above, our 
version of the enzyme does not contain tags or excess amino 
acids that are known to drastically reduce catalytic activity of 
the enzyme.’ Second, we maintained a high concentration of 
fresh reducing agent in the primary assays, which helped to 
minimize but did not eliminate covalent compounds as primary 
leads. Finally, we utilized a highly fluorescent FRET-based 
3CLpro peptide substrate composed of a HiLyte fluor 488 
fluorescent label attached at the N-terminus and a QXL520 
quencher dye attached to the C-terminus. This substrate allows 


for excitation in the spectral region of 488 nm that helps to 
eliminate spectral overlap with the compounds in the screen. 
Moreover, the high fluorescence quantum yield of the 
fluorescent probe and sufficiently low turnover number of the 
enzyme yields a highly sensitive assay that is amenable to 
screening in 1536 plate format with low («K,,) substrate 
concentrations. 

The primary screen of the molecular libraries small molecule 
repository (~293,000 compounds at the time of screen) was 
conducted using a 6 “M concentration of each compound. The 
primary screen produced 406 hits with greater than 12.25% 
inhibition. An excellent Z’ score of 0.92 and a relatively low hit- 
rate of 0.14% were observed. Among the 406 hits, 380 were 
available for reorder and were tested in a confirmation screen 
(singles at 10 4M). This second screen produced 136 active 
compounds with greater than 12.25% inhibition. A subsequent 
dose—response testing of 101 compounds provided 44 actives 
with IC. values below 10 “~M. A number of these actives were 
then counter-screened against the SARS-CoV papain-like 
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Figure 4. Selected SAR highlights from first round Ugi library. 


protease, PLpro, which is also a potential target since it too is 
vital for SARS-CoV maturation.*°*” In general, the majority of 
IC. values were substantially greater than 10 4M against PLpro 
suggesting moderate to good selectivity for 3CLpro inhibition. 
Prior to initiation of the chemistry phase of the project, 
SRIMSC performed structural clustering analysis of the 101 
confirmed actives. Several interesting scaffold clusters and a 
handful of singletons were identified from this analysis. A 
dipeptide class, represented by hit 14 (Figure 3, SID49730186), 
was one of two analogues with an IC<, below 10 uM. This furyl 
amide series was of particular interest due to its starting 
potency, modular nature, structural properties, and absence of 
reactive functionality found within typical covalent modifiers. 
Probe Optimization of Furyl Amide 14. Within the furyl 
scaffold, an initial 80 compound library array 15 was designed 
based upon HTS data using an Ugi MCC (multicomponent 
condensation) strategy (Scheme i) aca 
In the initial Ugi array, the isocyanide was held constant to 
afford a tert-butyl group at Ry. The a-aryl substituent R; was 
maintained either as a 3-pyridyl, as found in the original hit 14, 
or as a 2-thienyl, which was also identified as a tolerable group 
in another closely related dipeptide cluster from HTS. Five 
monomers were selected for the R, amine group to examine 
modifications of the aryl, and a cyclopropyl was included as a 
potential aryl replacement as a means to reduce MW and 
overall lipophilicity. For the R, carboxylic acid component, 
eight monomers were chosen focusing exclusively on 
substituted aromatics and heteroaromotics. As part of the R, 
group scan, a furan-3-carboxylic acid was included as a control 
for SAR comparisons relative to 14. All 80 compounds were 
successfully prepared and submitted for testing. 
Disappointingly, only two Ugi library analogues displayed 
inhibition levels greater than 50% at 100 uM and subsequently 
were determined to have IC<, values less than 50 ~M. Notably, 
these active analogues were exclusively within the 3-pyridyl 
subseries (Figure 4, 16 and 17). Within the 3-pyridyl series, 


replacement of the lipophilic N-aryl R, group with either a 
cyclopropyl group or a 4-fluoro phenyl derivative (Figure 4, 18 
and 19) were deleterious for inhibition, indicating the 
importance of steric bulk for optimal interaction within this 
putative subpocket of the enzyme. The 3-thienyl congener 20, a 
direct analogue of 16, resulted in a complete loss of inhibition 
activity. 

Concurrent with synthetic plans to prepare a second 
generation Ugi library, SARS-CoV 3CLpro crystals suitable 
for X-ray diffraction and analysis were obtained with inhibitor 
16 bound to 3CLpro.*” The electron density associated with 
inhibitor 16 is clearly resolved and allowed for accurate 
refinement of its position (Figure 6). The binding orientation 
of 16 is overall similar to known covalent peptidomimetic 
inhibitors, with inhibitor 16 preferentially occupying the S,—S,, 
subpockets of the 3CLpro enzyme as the R enantiomer 
(Figures S—7).°* In this orientation, the tert-butyl amide 
occupies the S; pocket, the tert-butylanilido group occupies a 
deep S, pocket, the 3-pyridyl group occupies the S,, and the 


furyl amide acts as a P, group. For comparison, the 


P2 
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Figure 5. Schematic of enzyme pockets occupied by 16 and 4. 
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Figure 6. SARS-CoV 3CLpro active site with bound inhibitor 16. (left) Wall-eye stereoview of the F,— F. electron density omit map (30) 
surrounding inhibitor 16. (right) Solvent-accessible surface view of 16-3CLpro complex (PDB code 3V3M, PubChem SID87915542). 


mechanism-based covalent inhibitor 4 (PDB code 2ALV) and 
its occupied subpockets are represented schematically in Figure 
§ relative to inhibitor 16.7’ In addition to the S;—S,, pockets 
occupied by 16, 4 occupies the S, pocket, thus relative to 4 the 
MW of 16 is reduced by over 100 amu and furthermore 16 
lacks a reactive warhead. 

An electron density map and a solvent-accessible surface 
depiction of 16 bound to 3CLpro are shown in Figure 6. A 
stereoview of the inhibitor-bound structure and associated 
interactions is also shown in Figure 7. Interestingly, the 3- 


Figure 7. X-ray crystal structure of 16 (capped sticks in green carbon) 
with SARS 3CLpro in wall-eye stereoview. Hydrogen bonds between 
inhibitor and 3CLpro are shown as black dotted lines. 


pyridyl ring nitrogen serves as a hydrogen bond acceptor within 
the S,-subpocket engaging His-163 with a N—N interatomic 
distance of 2.8 A. A similar hydrogen-bond interaction with 
His-163 engaged is retained in other 3CLpro crystal 
structures.”' The furan ring oxygen and the amide carbonyl 
oxygen have a bifurcated interaction with the backbone NH of 
Gly-143. Although the distance to the amide carbonyl is 0.5 A 
closer to Gly-143 and appears to be a hydrogen bond, based 
upon distance (C=O to NH = 2.9 A), the geometry is not 
ideal. In fact, the NH of Gly-143 is nearly coplanar with both 
oxygen atoms. On the floor of the S, pocket, the catalytic Cys- 
145 is positioned beneath both the amide carbonyl carbon and 
the furan oxygen at a distance of 3.5 A. With this rather unique 
furyl amide interaction within the context of the catalytic 
residues and backbone Gly-143 interaction, we set out to 
understand this interaction further by preparing a second 


chemical library focusing exclusively on the P,, group holding 
the P,—P, groups constant. Results from this campaign are 
summarized in Figure 8 (21—48). 

Replacement with five-membered z-excessive heterocycles 
proved most successful with 22, 27, and 31 having IC.9’s ~5S0 
HM and below. A fully saturated tetrahydrofuran (32) and a 
select set of acyclic analogues were >10-fold less active with 
ICs9s > 50 wM. Substitution on the furan ring negatively 
impacted activity (21, 26) and although replacement of the 
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furan oxygen atom with a larger sulfur atom in the related 
thiophene (27) was weakly tolerated (10-fold loss), the related 
pyrrole (37) heterocycle was completely inactive. The nearly 
equipotent inhibition by the imidazole 31 (SID93616598) 
however was encouraging, and thus a smaller subset of 
analogues was prepared to complete the survey of five- 
membered heterocycles (49-52). This effort identified 2- 
oxazolyl analogue 49 (SID99350507) with similar potency to 
the 4-oxazolyl isomer 22. Interestingly, in contrast to the 2- 
methyl furyl derivative 21, the 2-chloro-substituted furyl 
congener 50 (SID99350505) was nearly equipotent to 16 
with an ICs, of 5.2 4M. SAR around imidazole 31 proved steep 
as the N-methyl C(4) substituted analogue 52 was >10-fold less 
active and addition of a methylene spacer (51) was deleterious 
for inhibition. 

A subsequent survey of P, replacements was performed in 
order to identify alternate hydrogen bond acceptor groups that 
might engage His-163, while retaining the 2-furyl amide P,, 
group (Table 1). 


Table 1. Focused P, Group Heterocyclic Library (53—61) 


N 
O 
Compound SID P, aoe 
ICs59 uM 
=N 
53 117686897 ; <M 10 
7 
54 117686898 aX 55 
ww, 
=N 
55 117686909 ’ <)- Inactive 
56 117686901 *{ON 45 
N= . 
57 117686900 <> Inactive 
fi~NH 0 
58 117686899 << 16% (100 uM) 
59 117686904 re Inactive 
\-NH 
60 117686905 J~ NA 23% (100 uM 
= t 6 (100 uM) 
61 117686906 ed 23% (100 uM) 


#ICxp values are average of two independent runs using triplicate 
concentrations; “Inactive” defined as %inhibtion <15% at 100 uM, CV 
<0.3. 


Among the various P, replacements examined (53-61) 
pyridazine (53) and pyrazine (54) were well tolerated, although 
no improvement was found over lead compound 16. More 
dramatic modifications around the pyridyl ring were clearly 
limited for reasons consistent with the inhibitor bound X-ray 
structure because both the 4- and 2-pyridyl analogues were 
either weak inhibitors (56) or inactive ($7) and addition of a 2- 
methyl in pyridyl analogue 55 was not permitted. Attempts to 


replace the 3-pyridyl P,; with 1,2-diazoles (58, 59), 1,2,3- 
triazoles (60), or 1,3-diazoles (61) were also unsuccessful in 
identifying compounds with improved inhibition. 

We next turned to chiral stationary phase supercritical fluid 
chromatography (SFC) to separate and isolate the single 
enantiomers of 16 (Figure 9). Highly specific inhibition was 
found for single stereoisomer 16-(R) with an ICs of 1.5 + 0.3 
LM, which was identified as the first eluting peak from chiral 
SFC. In addition, inhibitor 16-(R) was found to specifically 
inhibit 3CLpro versus PLpro. Inhibitor 16-(R) was sub- 
sequently declared MLPCN probe ML188."' Inhibitor 16-(R) 
was inferred as the R stereoisomer based upon the absolute 
configuration observed in the electron density map of the X-ray 
structure of the 16-3CLpro complex, and it was found to rotate 
plane polarized light in the positive direction. Finally, the 
mechanism of inhibition of SARS 3CLpro by 16-(R) was 
determined to be competitive based on a double-reciprocal plot 
of the initial rate of the SARS 3CLpro catalyzed reaction versus 
variable concentrations of substrate and fixed, variable 
concentrations of 16-(R) (see Figure 2, Supporting Informa- 
tion). The resulting K, value was determined to be 1.6 + 0.26 
HM, which is equivalent to its IC; value and is expected since 
the IC, values for the inhibitors are determined at a substrate 
concentration that is significantly lower than the K,, value. 

SARS-CoV Antiviral Activity of 16-(R). The potency of 
16-(R) against SARS-CoV 3CLpro was deemed sufficient to 
determine the antiviral potency of the compound against SARS- 
CoV Urbani infected Vero E6 cells. Using our established 
antiviral assay and BSL3 protocols, we generated a dose— 
response curve of 16-(R) against mock-infected and SARS-CoV 
infected cells, and the results are shown in Figure 10.” 

A fit of the data presented in Figure 10 yields an antiviral 
EC, value of 12.9 + 0.7 uM. A second experiment, performed 
one month later, yielded an ECs value of 13.4 + 1.0 uM. The 
two independent experiments demonstrate that the resulting 
noncovalent inhibitor, 16-(R), can effectively inhibit SARS- 
CoV replication in cell culture. 

We searched the literature for other noncovalent inhibitors 
against SARS-CoV 3CLpro that also have antiviral data 
associated with the compound. The compound cinanserin is 
reported to have antiviral activity.*° In that report, SARS-CoV 
infected Vero cells were treated with 50 wg/mL (134 pM) 
cinanserin. The authors conducted an RNA quantitation assay 
of the cell culture media and reported a 3 log reduction and 
indicate that this reduction correlates with the titer of infectious 
particles. The authors also did a control cytotoxicity assay and 
reported that the cytotoxic concentration that gives 50% 
reduction (CC.,) is 31 uM and the CCo, is 66 wM. Given the 
fact that under these high cinanserin concentrations (134 uM) 
the majority of Vero cells (>90%) are likely killed by cinanserin, 
interpretation of the antiviral data in these studies is deemed 
ambiguous. 

We next compared the EC, of 16-(R) value to another 
noncovalent compound tested against SARS-CoV infected cells 
that targets the SARS-CoV papain-like protease (PLpro) 
known as GRLO617.*” Compound GRL0617 has an IC« 
against the PLpro enzyme of 600 nM and an EC,, value of 
14.5 wM against SARS infected Vero E6 cells.*” We next 
compared the EC.,/IC<o ratios of the noncovalent compounds, 
16-(R) and GRL0617, which are ~8.7 and ~24.1, respectively. 
This ratio can help guide compound design in terms of 
estimating what value of an ICs, needs to be achieved in order 
to achieve a pharmacologically relevant effect. Therefore, we 
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Figure 10. Vero E6 cells were mock-infected (blue diamonds) or 
infected with SARS-CoV (red squares) for 1 h prior to the addition of 
the antiviral compound 16-(R). At 48 h postinfection, cell viability was 
determined using Cell Titer-Glo luminescent cell viability assay 
(Promega). The error bars represent the standard deviation of 
triplicate samples. Experiments were performed in duplicate on 
separate days. 


compared these values to the most potent covalent 3CLpro 
inhibitor identified to date, TG-0205221 (5).”* This compound 
has an EC.,/K, value of 0.6 4M/0.052 “M or ~11.5. Thus, 
noncovalent and covalent inhibitors of 3CLpro can achieve 
similar antiviral effects. 

Based upon the structural information, excellent 3CL 
protease inhibition activity, and SARS-CoV Ubani antiviral 
activity, 16-(R) was elected as a first in class probe candidate 
from the furyl amide series. Probe 16-(R) was found to be 
highly selective for 3CLpro versus PLpro, and in a Ricerca lead- 
profiling screen” against 68 discrete GPCRs, ion channels and 
transporters, no significant activity was found (<S0% at 10 
uM). In PBS buffer at neutral pH, probe 16-(R) was found to 
have excellent aqueous solubility up to 95 wg/mL or 219 uM. 
Extended characterization of probe inhibitor 16-(R) and 
analogues against other cysteine proteases and in SARS-CoV 
infected cells as well as DMPK studies to assess metabolic 
stability and plasma protein binding are ongoing. 


M@ CONCLUSIONS 


In summary, we have described the optimization and molecular 
interaction details for a potent, noncovalent inhibitor of SARS- 
CoV 3CLpro identified through the MLPCN initiative. A four- 
component Ugi reaction was utilized to rapidly generate SAR 
and optimize potency focusing on the S,,, S,, and S, binding 
pockets. X-ray analysis of the 3CLpro—16-(R) complex reveals 


[a]p = +38.9 (c = 1.0, CHCl) 
3CLpro ICsp = 1.540.3 uM 
PLpro Inactive 


several key interactions crucial for activity; in particular, a 
hydrogen bonding interaction between the inhibitor 3-pyridyl 
ring nitrogen and the active site His-163 side-chain located 
within the S,-subpocket. Probe 16-(R) (ML188) is a modest 
molecular weight SARS-CoV 3CLpro inhibitor with demon- 
strated antiviral activity and a noncovalent mechanism of action 
and thus provides the opportunity to facilitate further structure- 
based design and inhibitor refinement in the quest for potential 
novel anti-SARS CoV therapies. Collaborative efforts in these 
laboratories continue toward the identification of submicromo- 
lar 3CLpro inhibitors with optimal properties for antiviral 
activity and potential testing in animal models. ML188 is an 
MLPCN probe and is freely available upon request. 


Mm EXPERIMENTAL SECTION 


General. All NMR spectra were recorded on a Bruker 400 and 600 
MHz instrument. 'H chemical shifts are reported in 6 values in ppm 
downfield from TMS as the internal standard in d;-MeOH or CDCl,. 
Data are reported as follows: chemical shift, multiplicity (s = singlet, d 
= doublet, t = triplet, q = quartet, br = broad, m = multiplet), 
integration, coupling constant (Hz). Low-resolution mass spectra were 
obtained on an Agilent 1200 series 6130 mass spectrometer. High- 
resolution mass spectra were recorded on a Waters Q-TOF API-US. 
Analytical thin layer chromatography was performed on Analtech silica 
gel GF 250 ym plates. Analytical HPLC was performed on an HP1100 
with UV detection at 214 and 254 nm along with ELSD detection, LC- 
MS (J-Sphere80-C18, 3.0 mm X 50 mm, using either a 1.1 or 3.1 min 
gradient, 5%[0.05%TFA/CH;CN]/95%[0.05%TFA/H,O] to 
100%[0.05%TFA/CH,CN]. Preparative RP-HPLC purification was 
performed on a custom HP1100 automated purification system with 
collection triggered by mass detection or using a Gilson Inc. 
preparative UV-based system using a Phenomenex Luna C18 column 
(SO mm X 30 mm LD., 5 wm) with an acetonitrile (unmodified)— 
water (0.1% TFA) custom gradient. Normal-phase silica gel 
preparative purification was performed using an automated Combi- 
flash companion from ISCO. Solvents for extraction, washing, and 
chromatography were HPLC grade. All reagents were purchased from 
Aldrich Chemical Co. and were used without purification. All polymer- 
supported reagents were purchased from Argonaut Technologies and 
Biotage. 

General Procedure for 4CC-Ugi Reaction in Library Format. 
To a series of 13 mm X 100 mm screw top glass tubes fitted with a 
magnetic stir bar equimolar amounts (0.08 mmol) of carboxaldehyde, 
amine, and carboxylic acid were combined in methanol (0.2 M, 1.5 
mL) and subsequently treated with tert-butylisocyanide (0.08 mmol). 
The mixture was stirred for 16 h at ambient temperature and then 
concentrated under a stream of nitrogen in a well-ventilated hood. The 
crude mixtures were reconstituted in MeOH, treated with Argoresin 
MP-Trisamine (Biotage Inc.) scavenger for 2 h and applied to a Celite 
pad using a manifold fitted with polypropylene filter tubes capable of 
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filtering 24 samples in parallel. The filtrates were purified directly using 
an automated mass-guided RP-HPLC, and product containing 
fractions were concentrated to give final products >95% purity as 
judged by LC-MS (215 nm and ELSD) and 'H NMR (representative 
library members >10%). All final library array product characterization 
data, 'H NMR spectra, and SARS-3CLpro % inhibition at 100 ~M can 
be found in Supporting Information. 

Analogues 21—48 and 49-61 were prepared in an analogous 
manner and purified by Gilson Inc. preparative UV-based RP-HPLC. 
Chromatographic, LC-MS data, and representative 'H and *C NMR 
spectra for these remaining analogues can be found in Supporting 
Information. 'H NMR data for 20% of these libraries have been 
provided. 

Preparation of (R)-N-(4-(tert-butyl)phenyl)-N-(2-(tert-butyla- 
mino)-2-oxo-1-(pyridin-3-yl)ethyl)furan-2-carboxamide (16- 
(R), ML188). To a 20 mL scintillation, equimolar amounts (0.5 
mmol) of pyridine-3-carboxaldehyde, 4-tert-butylaniline, and furan-2- 
carboxylic acid were combined in methanol (0.2M, 2.5 mL) and 
treated with tert-butylisocyanide (0.5 mmol). The mixture was stirred 
for 16 h at ambient temperature and then concentrated and applied as 
a DCM solution to a silica column (12 g) and purified over a gradient 
of 0 to 100% EtOAc in hexanes. The racemic product was obtained as 
a light yellow solid (198 mg, 91%): LC-MS (>98%) m/z = 434 [M + 
H]. Separation of enantiomers was accomplished using supercritical 
fluid chromatography. The column was an IA (UV 250 nm, 10 mm X 
250 mm, Chiral Technologies), eluent 6% MeOH in CO,. The desired 
peak as confirmed by the primary assay was identified as the first 
eluting peak, retention time = 2.91 min (CID 46897844): [a]p = 
+38.9 (c = 1.0, CHCI,). ‘H NMR (400 MHz, CDCI,) 6 8.51 (1H, d, J 
= 2), 8.49 (1H, dd, J = 4.8, 2), 7.53 (1H, dt, J = 8, 2), 7.41 (1H, d, J= 
1), 7.29 (2H, d, J = 6.6), 7.23 (2H, d, J = 6.6), 7.09 (1H, dd, J = 8, 4.8), 
6.18 (1H, dd, J = 3.6, 2), 6.12 (1H, s), 5.41 (1H, d, J = 3.6), 1.40 (9H, 
s), 1.28 (9H, s). *C NMR (400 MHz, CDCI,) 6 167.7, 159.5, 152.4, 
151.3, 149.4, 146.1, 144.9, 138.1, 136.4, 130.4, 130.1, 126.0, 122.7, 
117.0, 111.1, 63.6, 51.7, 34.6, 31.2, 28.6. HRMS (ESI, M + H) calcd 
for Cy,H,N3;O, 434.2440, found 434.2444. 

N-(2-(tert-Butylamino)-2-o0xo-1-(pyridin-3-yl)ethyl)-N-(4- 
isopropylphenyl)furan-2-carboxamide, 17. 'H NMR (400 MHz, 
CDCl,) 6 8.46 (2H, m), 8.49 (1H, d, J = 7.9 Hz), 7.38 (1H, s), 7.07 
(3H, m), 6.98 (2H, bs), 6.15 (2H, m), 6.10 (1H, s), 5.38 (1H, d, J = 
3.6 Hz), 2.87 (1 H, p, J = 6.9 Hz), 1.37 (9H, s), 1.20 (6H, d). °C 
NMR (100 MHz, CDCI,) & 167.8, 159.5, 151.4, 150.0, 149.4, 146.1, 
144.8, 138.0, 136.6, 130.5, 130.4, 127.0, 122.7, 117.0, 111.1, 63.5, 51.6, 
33.6, 28.5, 23.8, 23.7. HRMS (ES+, M + H) calcd for C,<H3)N30; 
420.2287, found 420.2290. 

N-(2-(tert-Butylamino)-2-oxo- 1-(pyridin-3-yl)ethyl)-N-cyclopro- 
pylfuran-2-carboxamide, 18. 'H NMR (400 MHz, CDCI,) 6 8.68 
(1H, s), 8.57 (1H, d, J = 4.0 Hz), 7.97 (1H, d, J = 7.9), 7.53 (1H, s), 
7.32 (1H, dd, J = 7.7, 4.8), 7.10 (1H, d, J = 3.4 Hz), 6.57 (1H, s), 6.51 
(1H, dd, J = 3.4, 1.6 Hz), 5.66 (1H, s), 2.90 (1H, m), 1.36 (9H, s), 
0.99 (1H, m), 0.80 (m, 1H), 0.68 (m, 2H). °C NMR (100 MHz, 
CDCl) 6 168.3, 162.4, 150.5, 148.9, 147.2, 144.5, 137.1, 131.1, 122.8, 
117.1, 111.2, 64.7, $1.1, 31.2, 28.2, 10.4, 9.5. HRMS (ES+, M + H) 
calcd for C))H,4,N30, 342.1818, found 342.1815. 

N-(2-(tert-Butylamino)-2-o0xo-1-(pyridin-3-yl)ethyl)-N-(4- 
fluorophenyl)furan-2-carboxamide, 19. 'H NMR (400 MHz, 
CDCI,) 5 8.41 (1H, d, J = 4.0 Hz), 8.37 (1H, s), 7.43 (1H, d, J = 
8.0), 7.29 (1H, s), 7.05 (2H, dd, J = 7.8, 4.8), 6.85 (2H, m), 6.29 (1H, 
s), 6.15 (1H, s), 6.13 (1H, dd, J = 3.5, 1.6 Hz), 5.55 (1H, d, J = 3.6 
Hz), 5.24 (s, 1H), 1.30 (9H, s). °C NMR (100 MHz, CDCI,) 6 167.8, 
162.5 (d, J = 248), 159.3, 151.3, 149.5, 146.1, 144.9, 137.8, 134.9 (d, J 
= 3.1 Hz), 132.9 (d, J = 8.6 Hz), 130.3, 123.0, 117.2, 115.8 (d, J = 22.4 
Hz), 111.1, 62.8, 51.7, 28.5. HRMS (ES+, M + H) calcd for 
Cy5H,,N303F 396.1723, found 396.1725. 

N-(4-(tert-Butyl)phenyl)-N-(2-(tert-butylamino)-2-0xo-1-(thio- 
phen-3-yl)ethyl)furan-2-carboxamide, 20. 'H NMR (400 MHz, 
CDCI,) 6 7.37 (1H, s), 7.26 (2H, s), 7.24 (1H, s), 7.13 (1H, m), 6.99 
(2H, bs), 6.89 (1H, d, J = 5.0 Hz), 6.13 (1H, m), 6.10 (1H, s), 6.06 
(1H, s) 5.33 (1H, d, J = 3.6 Hz), 1.35 (9H, s), 1.29 (9H, s). "C NMR 
(100 MHz, CDCl,) 6 168.1, 159.3, 152.0, 146.4, 144.6, 137.2, 134.9, 


129.6, 129.0, 126.7, 125.8, 125.1, 116.7, 111.1, 61.6, $1.4, 34.6, 31.3, 
28.6. HRMS (ES+, M + H) calcd for C,;H3,;N,03S 439.2055, found 
439.2057. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)oxazole-5-carboxamide, 22. 'H NMR (400 MHz, CDCI,) 
6 8.48 (2H, s), 7.84 (1H, s), 7.48 (1H, d, J = 7.9 Hz), 7.28 (2H, m), 
7.08 (1H, m), 6.06 (1H, s), 5.85 (1H, bs), 5.70 (1H, s), 5.30 (1H, s) 
1.37 (9H, s), 1.27 (9H, s). °C NMR (100 MHz, CDCI,) 6 167.3, 
157.7, 152.8, 152.1, 151.2, 149.3, 144.1, 137.7, 134.9, 131.1, 130.3, 
129.9, 125.9, 122.7, 62.9, $1.5, 34.4, 30.9, 28.3. HRMS (ES+, M + H) 
calcd for C;;H3;N,O03 435.2396, found 435.2397. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)thiophene-2-carboxamide, 27. 'H NMR (400 MHz, 
CDCl,) 6 8.47 (1H, s), 8.44 (1H, d, J = 4.0 Hz), 7.48 (1H, d, J = 
8.0 Hz), 7.29 (1H, dd, J = 4.8, 0.9 Hz), 7.23 (2H, d, J = 8.3 Hz), 7.05 
(1H, dd, J = 7.9, 4.8 Hz), 7.01 (1H, bs), 6.75 (2H, m), 6.29 (1H, bs), 
6.14 (1H, s), 1.36 (9H, s), 1.26 (9H, s). °C NMR (100 MHz, CDCl,) 
5 167.9, 162.9, 152.6, 151.4, 149.4, 138.0, 137.5, 136.5, 133.1, 131.6, 
130.6, 130.5, 126.7, 126.1, 122.7, 64.1, 51.7, 34.6, 31.2, 28.6. HRMS 
(ES+, M + H) calcd for C,,H3,N;0,S 450.2215, found 450.2213. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)-1H-imidazole-4-carboxamide, 31. ‘'H NMR (600 MHz, 
CDCl,) 6 11.666 (1H, bs), 8.47 (1H, d, J = 1.9 Hz), 8.41 (1H, dd, J = 
3.2, 1.4 Hz), 7.59 (2H, s), 7.39 (1H, d, J = 5.3 Hz), 7.22 (3H, bs), 7.01 
(1H, dd, J = 5.3, 4.8 Hz), 6.32 (1H, bs), 6.20 (1H, s), 5.43 (1H, bs), 
1.24 (9H, s), 1.23 (9H, s). °C NMR (150 MHz, CDCI,) 6 167.8, 
161.1, 152.9, 151.4, 149.5, 137.9, 137.3, 135.6, 133.2, 130.5, 1262, 
125.0, 122.8, 63.5, 51.6, 34.6, 31.2, 28.5. HRMS (ES+, M + H) calcd 
for C,,H3,N,O, 434.2556, found 434.2555. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)-2-methoxybenzamide, 33. 'H NMR (400 MHz, CD;OD) 
6 8.72 (1H, bs), 8.60 (1H, bs), 8.24 (1H, bs), 7.73 (2H, bs), 7.30 (1H, 
d, J = 7.4 Hz), 7.22 (1H, t, J = 7.4 Hz), 7.11 (2H, d, J = 8.4 Hz), 7.00 
(2H, d, J = 8.3 Hz), 8.87 (1H, t, J = 7.4 Hz), 6.76 (1H, d, J = 8.2 Hz), 
6.25 (1H, s), 3.68 (3H, s), 1.29 (9H, s), 1.15 (9H, s). LC-MS (M + H) 
= 474.2. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)isonicotinamide, 35. 'H NMR (400 MHz, CD;OD) 6 8.57 
(1H, s), 8.50 (3H, m), 7.98 (1H, m), 7.57 (2H, m), 7.51 (1H, m), 7.12 
(4H, m), 6.28 (1H, s), 1.34 (9H, s), 1.14 (9H, s). LC-MS (M + H) = 
445.2. 

N-(tert-butyl)-2-(N-(4-(tert-butyl)phenyl)-2-hydroxyacetamido)- 
2-(pyridin-3-yl)acetamide, 40. 'H NMR (400 MHz, CD;,OD) 6 8.50 
(2H, m), 7.99 (1H, d, J = 6.5 Hz), 7.55 (1H, m), 7.33 (2H, d, J = 9.1 
Hz), 7.15 (2H, bs), 6.12 (1H, s), 3.83 (1H, d, J = 3.0 Hz), 1.29 (9H, 
s), 1.26 (9H, s). LC-MS (M + H) = 398.2. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)-3-hydroxybenzamide, 44. 'H NMR (400 MHz, CD,OD) 
6 8.63 (1H, bs), 8.56 (1H, bs), 8.16 (1H, bs), 7.66 (1H, m), 7.14 (2H, 
d, J = 8.8 Hz), 7.02 (2H, d, J = 8.1 Hz), 6.97 (1H, t, J = 7.9 Hz), 6.78 
(1H, s), 6.74 (1H, d, J = 7.7 Hz), 6.66 (1H, dd, J = 8.2, 1.5 Hz), 6.26 
(1H, s), 1.29 (9H, s), 1.17 (9H, s). LC-MS (M + H) = 460.2. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)pyrimidine-2-carboxamide, 45. 'H NMR (400 MHz, 
CD;0D) 6 8.65 (1H, d, J = 11.8 Hz), 8.54 (1H, d, J = 5.2 Hz), 
8.43 (1H, s), 8.38 (1H, s), 8.10 (1H, d, J = 6.8 Hz), 7.60 (1H, m), 7.13 
(1H, d, J = 8.6 Hz), 7.06 (1H, d, J = 7.9 Hz), 6.32 (1H, s), 1.31 (9H, 
s), 1.15 (9H, s). LC-MS (M + H) = 446.3. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)-4-hydroxybenzamide, 47. 'H NMR (400 MHz, CD,;OD) 
6 8.61 (1H, s), 8.54 (1H, d, J = 5.0 Hz), 8.12 (1H, d, J = 6.4 Hz), 7.63 
(1H, m), 7.19 (4H, m), 7.02 (2H, d, J = 8.2 Hz), 6.54 (2H, d, J = 8.7 
Hz), 6.24 (1H, s), 1.29 (9H, s), 1.20 (9H, s). LC-MS (M + H) = 
460.2. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(pyridin- 
3-yl)ethyl)-1-methyl-1H-imidazole-4-carboxamide, 52. 'H NMR 
(400 MHz, CD;OD) 6 8.81 (1H, s), 8.46 (1H, s), 8.43 (1H, d, J = 
4.76 Hz), 7.78 (1H, d, J = 8.0 Hz), 7.37 (SH, m), 6.23 (1H, s), 5.44 
(1H, s), 3.63 (3H, s), 1.33 (9H, s), 1.29 (9H, s). LC-MS (M + H) = 
448.2. 
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N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-1-(6-methylpyri- 
din-3-yl)-2-oxoethyl)furan-2-carboxamide, 55. 'H NMR (400 MHz, 
CD,OD) 6 8.56 (1H, s), 8.24 (1H, d, J = 8.3 Hz), 7.71 (1H, d, J = 8.3 
Hz), 7.52 (1H, s), 7.41 (2H, d, J = 8.8 Hz), 7.22 (2H, m), 6.28 (1H, 
dd, J = 3.6, 3.3 Hz), 6.23 (1H, s), 5.62 (1H, d, J = 3.5 Hz), 2.69 (3H, 
s), 1.31 (9H, s), 1.27 (9H, s). LC-MS (M + H) = 448.3. 

N-(4-(tert-Butyl) phenyl)-N-(2-(tert-butylamino)-2-oxo-1-(1H- 
1,2,3-triazol-4-yl)ethyl)furan-2-carboxamide, 60. 'H NMR (400 
MHz, CD;OD) 6 7.52 (1H, d, J = 1.2 Hz), 7.44 (1H, s), 7.34 (2H, 
d, J = 8.6 Hz), 7.15 (2H, bs), 6.30 (1H, s), 6.25 (1H, dd, J = 3.6, 1.6 
Hz), 5.46 (1H, d, J = 3.5 Hz), 4.58 (1H, s), 1.34 (9H, s), 1.29 (9H, s). 
LC-MS (M + H) = 424.2. 

Expression and Purification of the SARS-CoV 3CLpro 
Enzyme. An expression construct for the SARS-CoV 3CLpro enzyme 
was designed to produce the exact coding region of the enzyme 
released in virus-infected cells. The expression construct contained a 
hexa-histidine affinity tag and a TEV protease site that is also 
recognized and cleaved by SARS-CoV 3CLpro. The coding region of 
SARS-CoV 3CLpro was codon-optimized and synthesized by BioBasic 
(Canada) and was inserted into a pET expression vector. The enzyme 
was expressed in Escherichia coli BL21(DE3) and was purified via 
multistep purification protocol that employed a cobalt-charged or 
nickel-charged metal-chelate HiTrap affinity column (GE Health 
sciences), DEAE anion-exchange chromatography, and size-exclusion 
chromatography. Purified SARS-CoV 3CLpro was then stored at —80 
°C or used immediately for crystallization. 

HTS Campaign Assays. 3CLpro HTS Assay Protocol. The 
primary assay began with the addition of 4 wL of 3CLpro enzyme 
(150 nM final concentration) in assay buffer (SO mM HEPES, 0.1 mg/ 
mL BSA, 0.01% Triton-X 100, 2 mM DTT) at pH 7.5 into each well of 
a 1536 microtiter plate. Next, 30 nL of test compound in DMSO, 
3CLpro inhibitor (300 uM final concentration) in DMSO, or DMSO 
alone (0.6% final concentration) was added to the appropriate wells. 
The plates were then incubated for 10 min at room temperature. After 
incubation, 1 wL of 3Clpro peptide substrate (2 mM final 
concentration) in 50 mM HEPES at pH 7.5 to each well. After 30 
min of incubation at room temperature, 1 wL of 500 mM acetic acid 
was added to each well to terminate the assay and well fluorescence 
was read on a PerkinElmer Viewlux using fluorescein filters: excitation 
wavelength of 480 nm (with 20 nm bandwidth) and emission 
wavelength of 540 nm (with 20 nm bandwidth). All data was 
normalized to that of the positive control (3CLPro inhibitor) and 
wells containing DMSO only (negative control). The same protocol 
was conducted for single point primary (PubChem AID 1706), 
triplicate point secondary (PubChem AID 1879), and confirmatory 
dose response assays (PubChem AID 1890). 

PLpro Counterscreen Assay Protocol. Prior to assay, PLpro peptide 
substrate and luciferase detection reagent were mixed in assay buffer 
(SO mM HEPES, 0.1 mg/mL BSA, 5 mM DTT, 0.5 mM EDTA and 1 
mM magnesium sulfate) at pH 7.5 and incubated for 60 min. The 
assay was begun by dispensing 2.5 wL of PLpro enzyme (7.5 nM final 
concentration) in assay buffer or assay buffer alone into each well of a 
1536 microtiter plate. Next, 30 nL of test compound in DMSO or 
DMSO alone (0.6% final concentration) was added to the appropriate 
wells. The plates were then incubated for 10 min at room temperature. 
Next, the enzyme reaction was initiated by dispensing 2.5 wL of the 
preincubated mixture containing PLpro peptide substrate and 
luciferase detection reagent (1 uM final substrate concentration). 
Finally, well luminescence was read on a PerkinElmer Viewlux after 60 
min of incubation at room temperature. All data was normalized to 
that of the positive control (no enzyme) and wells containing DMSO 
only (negative control). This protocol was used for confirmatory dose 
response assays (PubChem AID 1944). Detailed assay protocols and 
data generated for HTS and probe development are found at 
PubChem AID 1859: pubchem.ncbi.nlm.nih.gov/assay/assay.cgi?aid= 
1859&loc=ea. 

Crystallization and X-ray Structure Determination of SARS- 
CoV 3CLpro in Complex with Inhibitor 16-(R). Purified SARS- 
CoV 3CLpro was concentrated to 16 mg/mL in a buffer composed of 
20 mM HEPES, pH 7.5, and 5 mM 2-mercaptoethanol. The enzyme 


was incubated on ice with a final concentration of 1 mM inhibitor. A 
1:1 enzyme to crystallization solution ration was used, and the 
crystallization solution consisted of 14.5% PEG 20000, 50 mM MES, 
pH 6.0, 50 mM potassium chloride, and 1% MPD. Crystallization trials 
were set up at room temperature using the method of hanging drop 
vapor diffusion. Significantly better crystal formation was observed in 
the absence of sodium chloride, and crystals were sensitive to small 
fluctuations in pH. Single, well-formed crystals were used for soaking 
in mother-liquor supplemented with 2 mM of inhibitor at room 
temperature for 3 h. MPD (15%) was used as a cryoprotectant for 
freezing the crystals in liquid nitrogen where they were stored until 
synchrotron time was available. Crystals were transferred from liquid 
nitrogen into a stream of dry nitrogen gas at 100 K for X-ray data 
collection. 

X-ray data were collected on a Rayonix-300 CCD detector at the 
Advanced Photon Source, Argonne National Laboratory, on beamline 
21 ID-F at the Life Sciences-Collaborative Access Team (LS-CAT). X- 
ray data were processed and scaled using HKL2000.*” The SARS-CoV 
3CLpro—ML188 complex crystallized as a single monomer in the 
asymmetric unit and in space group C, with unit cell dimensions of a = 
106.73 A, b = 82.67 A, c = $3.12 A, B = 106.0°. The crystal diffracted to 
a resolution 1.95 A. The X-ray intensity data had a final Ryerge Of 6.4%, 
and the data were 97.8% complete overall. Additional data processing 
statistics are provided in Table 4 in Supporting Information. 

X-ray intensities were converted to structure-factor amplitudes by 
the method of French and Wilson using the program TRUNCATE in 
the CCP4 program suite.°’ The initial phases for the model were 
determined by the method of molecular replacement using the 
program MOLREP™ in the CCP4 program suite. The search model 
used for molecular replacement consisted of a monomer of SARS-CoV 
3CLpro from PDB code 2ALV with all side chains intact, and all 
waters and ligands removed. The final and optimal molecular 
replacement solution contained a single monomer in the asymmetric 
unit. 

An initial round of combined positional and B-factor refinement was 
performed with the program REFMAC” in the CCP4 program suite 
using a maximum-likelihood target function and no o cutoff on 
structure factor amplitudes. Initial difference Fourier maps were 
calculated and visualized using the program COOT.°® The initial F, — 
F. difference maps revealed strong (+40) residual electron density 
peaks in the active site for the inhibitor 16-(R). A molecular model for 
the inhibitor was built using the Monomer Library Sketcher program 
in the CCP4 program suite, and a monomer library description was 
created for refinement. Iterative rounds of refinement were performed, 
and water molecules were added manually into strong (+40) difference 
density peaks in the initial refinement stages and into peaks of (+30) 
in the final stages of refinement. During these iterative refinements, 
residual electron density consistent with two DMSO molecules from 
the buffer solution were identified, and these molecules were built into 
residual density and included in all subsequent refinements. 

Iterative refinement using REFMAC was continued until the R,,,., 
and R,,. values plateaued at their lowest values, which were 21.5% and 
27.1%, respectively. At this point, the coordinates for the resulting 
model were submitted to the TLS (translation/libration/screw) server 
to generate a multigroup TLS model.*’ The resulting TLS groups were 
visualized using the molecular viewer on the TLS Web site, and 20 
TLS groups were chosen.°* Two rounds of TLS and restrained 
refinement®” were performed in REFMAC with the weighting term set 
at 0.1. The resulting and final R,,,,, and Rge- values were 18.7% and 
23.5%, respectively, justifying the inclusion of TLS groups in the 
standard refinement protocol.” The final model coordinates have 
been deposited in the PDB under accession code 3V3M.” A summary 
of the final X-ray data refinement statistics are given in Table 4 in 
Supporting Information. 

SARS-CoV Antiviral Activity Assays. The SARS-CoV Urbani 
strain was used in these experiments and was provided by the Centers 
for Disease Control and Prevention. Maintenance of the Vero E6 cells 
used in these studies was achieved using Dulbecco’s minimal essential 
media (DMEM) (Gibco) supplemented with 100 units/mL penicillin, 
100 pg/mL streptomycin (Gibco), and 10% FCS (Altanta Biologicals). 
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All antiviral experiments on SARS-CoV Urbani were carried out at 
Purdue University using a Biosafety Level 3 suite and approved 
biosafety protocols established in collaboration with the Purdue 
University Institutional Biosafety Committee. 

Growth of Vero E6 cells was performed by seeding cells onto flat- 
bottom, 96-well plates at a density of approximately 9 x 10° cells per 
well. Vero E6 cells were either mock-infected with serum-free DMEM 
or infected with 100-fold the median tissue culture infective dose of 
SARS-CoV Urbani per well in 100 wL of serum-free MEM and 
incubated for 1 h at 37 °C with 5% CO,. The viral inoculum was 
removed after the 1 h incubation period and then 100 wL of MEM, 
supplemented with 2% FCS and the 16-(R) inhibitor at concentrations 
ranging from 30 to 0.1 4M, was added. Cells were then incubated for 
48 h at 37 °C with 5% CO,. All controls and each inhibitor 
concentration were set up in triplicate, and the antiviral assays were 
performed independently on at least two separate occasions. Cell 
viability was determined approximately 48 h after infection using the 
CellTiter-Glo luminescent cell viability assay (Promega). 
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